Microphone front-ends for spatial sound analysis and synthesis with Directional Audio Coding

نویسندگان

  • Jukka Ahonen
  • Ville Pulkki
چکیده

Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Jukka Ahonen Name of the doctoral dissertation Microphone front-ends for spatial sound analysis and synthesis with Directional Audio Coding Publisher School of Electrical Engineering Unit Department of Signal Processing and Acoustics Series Aalto University publication series DOCTORAL DISSERTATIONS 33/2013 Field of research Acoustics and Audio Signal Processing Manuscript submitted 17 September 2012 Date of the defence 8 March 2013 Permission to publish granted (date) 14 December 2012 Language English Monograph Article dissertation (summary + original articles) Abstract A large number of professional and domestic audio applications utilize spatial sound reproduction. In addition to the conventional applications, such as the surround sound in movie and home theaters, spatial sound is also applied for telecommunication purposes. For instance in teleconferencing, sound emanated by talkers can be captured with multiple microphones at one end and reproduced spatially distributed with multiple loudspeakers at the other. This has benefit over a typical monophonic reproduction of the teleconference in terms of speech intelligibility and other elements of communication.A large number of professional and domestic audio applications utilize spatial sound reproduction. In addition to the conventional applications, such as the surround sound in movie and home theaters, spatial sound is also applied for telecommunication purposes. For instance in teleconferencing, sound emanated by talkers can be captured with multiple microphones at one end and reproduced spatially distributed with multiple loudspeakers at the other. This has benefit over a typical monophonic reproduction of the teleconference in terms of speech intelligibility and other elements of communication. During the last decade there has been an increasing research interest in parametric spatial sound processing. Several techniques for estimating the directional parameters of a sound field from multichannel audio files or from microphone signals have been proposed. In the parametric techniques, the directional information can be efficiently transmitted and then applied to spatial sound synthesis for various purposes. This thesis discusses Directional Audio Coding (DirAC) for capturing, transmitting and reproducing spatial sound. The perceptually motivated time-frequency processing of DirAC provides a parametric description of spatial sound, namely the arrival direction and diffuseness of sound. Direction and diffuseness, when analyzed in the time-frequency resolution of human hearing, are assumed to transmit enough information on the captured sound field for spatial hearing. DirAC has several applications of spatial audio, of which teleconferencing is mainly the focus here. The author's research addresses the development of different microphone front-ends for DirAC. The methods to analyze a sound field with input from arrays of omnidirectional microphones and from typical directional stereo microphones were studied. A novel method for diffuseness estimation was developed as a part of this work. Microphone arrays, which exploit an acoustic shadowing between microphones, are also proposed as an acoustical frontend for DirAC, as are the methods to conduct directional analysis with such arrays. These methods overcome the issues, which occur in direction analysis with input from the conventional microphone arrays, and thus provide reliable direction estimate over the entire audio frequency range. In the thesis, DirAC processing is also applied to bilaterally-fitted hearing aids with two microphones at each ear. The use of different microphone front-ends is evaluated through measurements and listening tests.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Broadband analysis and synthesis for Directional Audio Coding using A-format input signals

Directional Audio Coding (DirAC) is a parametric non-linear technique for spatial sound recording and reproduction, with flexibility in terms of loudspeaker reproduction setups. In the general 3-dimensional case, DirAC utilizes as input B-format signals, traditionally derived from the signals of a regular tetrahedral first-order microphone array, termed A-format. For high-quality rendering, the...

متن کامل

Applications of Directional Audio Coding in Audio

Directional Audio Coding (DirAC) is a method for spatial sound representation, applicable to different sound reproduction systems. In the analysis part, the diffuseness and direction of arrival of sound is estimated in a single location depending on time and frequency. In the synthesis part, microphone signals are first divided into non-diffuse and diffuse parts, and are then reproduced using d...

متن کامل

Directional audio coding - perception-based reproduction of spatial sound

Directional Audio Coding (DirAC) is a perceptually motivated technique for spatial audio processing. DirAC analyzes in short time windows the sound spectrum together with direction and diffuseness in frequency bands of human hearing, and uses this information in synthesis. It has applications in capturing, coding and resynthesis of spatial sound, in teleconferencing, in directional filtering, a...

متن کامل

On the Use of Small Microphone Arrays for Wave Field Synthesis Auralization

The synthesis of a captured sound field with preservation of its perceptual properties is called auralization. Data-based Wave Field Synthesis (WFS) auralization makes use of a set of measured impulse responses along an array of microphone positions. However, a considerable array size must be employed for having an appropriate angular resolution. In this paper, we explore the possibilities of t...

متن کامل

Parametric Spatial Audio Effects

Parametric spatial audio coding methods aim to represent efficiently spatial information of recordings with psychoacoustically relevant parameters. In this study, it is presented how these parameters can be manipulated in various ways to achieve a series of spatial audio effects that modify the spatial distribution of a captured or synthesised sound scene, or alter the relation of its diffuse a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013